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Abstract 

We  offer  a  theory  of  economic  fluctuations  based  on  intertemporal  increasing  returns:  agents 
who  have  been  active  in  the  past  face  lower  costs  of  action  today.  This  specification  explains  the 
observed  persistence  in  individual  and  aggregate  output  fluctuations  even  in  the  presence  of  i.i.d 
shocks,  essentially  because  individuals  respond  to  the  same  shock  differently  depending  on  their 
recent  past  experience.  A  feature  of  our  model  is  that  output  growth  follows  an  unobserved 
components  process  with  special  emphasis  on  an  underlying  cyclical  indicator.  The  exact  process  for 
output,  the  sharpness  of  turning  points  and  the  degree  of  asymmetry  are  determined  by  the  form  that 
heterogeneity  takes.  We  suggest  a  general  formulation  for  models  with  latent  cyclical  variables  which, 
under  certain  assumptions,  reduces  to  a  number  of  state  space  models  that  have  been  successfully 
used  to  model  U.S.  GNP.  Using  our  general  formulation  we  find  that  allowing  for  richer  heterogeneity 
enables  us  to  obtain  a  better  fit  to  the  data  and  also  that  U.S.  business  cycles  are  asymmetric,  with 
this  asymmetry  manifesting  itself  as  steep  declines  into  recession.  We  estimate  a  strongly  persistent 
cyclical  component  which  is  not  well  approximated  by  discrete  regime  shifts  nor  by  linear  models. 
Our  estimates  of  the  relative  size  of  intertemporal  returns  needed  to  explain  U.S.  GNP  vary  but  on 
the  whole  are  not  implausibly  large. 
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1.  Introduction 

Aggregate  economic  fluctuations  are  characterized  by  successive  periods  of  high  growth 
followed  by  consecutive  periods  of  low  activity.  The  transitions  between  these  periods  of  high  and 
low  growth  are  often  marked  by  sharp  turning  points  and  considerable  evidence  suggests  that  at  these 
moments  the  stochastic  properties  of  the  economy  change  and  display  asymmetries,  see  inter  alia, 
Neftci  (1984),  Diebold  and  Rudebusch  (1989),  Hamilton  (1989)  and  Acemoglu  and  Scott  (1994).  The 
importance  of  tracking  these  movements  in  the  business  cycle  is  reflected  in  the  considerable  attention 
paid  to  a  variety  of  coincident  and  leading  indicators  (e.g.  Stock  and  Watson  (1989),  and  the  papers 
in  Lahiri  and  Moore  (1991)). 

A  natural  way  to  model  temporal  agglomeration1  and  asymmetries  in  economic  fluctuations 
is  to  assume  non-convexities,  such  as  discrete  choice  or  fixed  costs  at  the  individual  level,  because 
such  non-convexities  imply  that  individuals  concentrate  their  activity  in  a  particular  period.  It  is  this 
implication  which  has  been  analyzed  with  considerable  success  in  the  (S,s)  literature.  However,  while 
the  presence  of  fixed  costs  can  account  for  the  discreetness  of  economic  turning  points,  it  does  not 
naturally  lead  to  persistence  -  once  an  individual  undertakes  an  action  they  are  less  likely  to  do  so 
in  the  near  future.  The  key  reason  is  that  although  the  presence  of  fixed  costs  leads  to  increasing 
returns,  these  are  infratemporal;  the  full  extent  of  economies  of  scale  arising  from  fixed  costs  can  be 
exploited  within  a  period2.  As  a  consequence  persistence  in  aggregate  fluctuations  relies  on 
aggregation  across  heterogeneous  agents:  either  more  agents  investing  in  the  past  increases  the 
profitability  of  investment  for  others  (e.g.  Durlauf  (1991)  and  (1993))  or  aggregate  shocks  affect 
agents  differently,  leading  to  a  smoothed  response  over  time  (e.g.  Caballero  and  Engel  (1991)). 

This  paper  emphasizes  an  alternative  explanation  of  persistent  individual  and  aggregate  output 
fluctuations  by  suggesting  that  fixed  costs  introduce  intertemporal  increasing  returns,  that  is,  the 
returns  from  an  activity  this  period  are  higher  if  the  activity  occurred  last  period  and  as  a  consequence 
an  agent  who  was  active  in  the  recent  past  is  more  likely  to  be  active  now.  In  the  presence  of  such 
intertemporal  linkages  temporal  agglomeration  arises  from  two  sources;  active  agents  maintaining  a 
high/low  activity  level  in  successive  periods  because  of  intertemporal  increasing  returns  and  agents 
acting  together,  either  in  response  to  a  common  shock  or  because  of  strategic  complementarities.  We 
show  how  a  model  of  the  former  explains  a  number  of  empirical  features  of  business  cycle 
fluctuations  and  also  offer  a  framework  which  enables  an  economic  interpretation  of  unobserved 
component  models  of  U.S.  output  which  emphasize  the  underlying  cyclical  indicator. 

The  relevance  of  intertemporal  increasing  returns  depends  on  (i)  whether  individual  behavior 
is  persistent  and  (ii)  whether  firm's  technology  and  costs  contain  important  intertemporal  aspects.  The 


1  We  use  this  term,  following  Hall  (1991),  for  the  bunching  of  high  economic  activity  over  time. 

2  To   overcome  this   problem  empirical   implementations   of  (S,s)   models   sometimes   include   time-to-build 
considerations  or  decreasing  returns  at  high  levels  of  investment,  e.g.  Caballero  and  Engel  (1994). 


answer  to  these  questions  will  vary  depending  upon  the  type  of  activity  under  consideration.  For 
instance,  in  the  case  of  radical  changes  in  the  capital  stock,  e.g.  Cooper  and  Haltiwanger's  (1993) 
automobile  retooling,  intertemporal  effects  are  unlikely  to  be  important,  and  this  is  supported  by  the 
lack  of  persistence  in  these  cases.  However,  Section  2  surveys  micro  evidence  that  firm  level 
investment  is  persistent  and  findings  from  technology  studies,  the  management  science  literature  and 
organizational  theory  which  supports  the  notion  that  many  important  discrete  decisions  (e.g. 
investment  in  new  technology,  product  development,  innovation,  maintenance)  exhibit  some  degree 
of  intertemporal  increasing  returns.  These  empirical  findings  motivate  the  theoretical  model  of  Section 
3  in  which  a  firm  has  to  decide  each  period  whether  to  undertake  both  maintenance  and  investment. 
Maintenance  is  modelled  as  having  two  effects:  (i)  increasing  the  productivity  of  existing 
technologies  and  (ii)  facilitating  the  adoption  of  new  innovations.  The  interaction  of  these  two  roles 
leads  to  intertemporal  increasing  returns:  firms  find  it  profitable  to  maintain  the  newly  adopted 
technologies  and  this  in  turn  reduces  the  costs  of  adopting  future  innovations.  As  a  result,  investment 
costs  are  lower  when  the  firm  has  invested  last  period,  and  a  natural  asymmetry  is  introduced  in 
individual  behavior;  in  response  to  a  range  of  shocks,  the  agents  will  find  it  profitable  to  invest  only 
if  they  have  invested  in  the  recent  past. 

Section  4  examines  the  aggregate  economic  fluctuations  implied  by  individual  level 
intertemporal  increasing  returns,  and  illustrates  how  our  model  accounts  for  the  main  features  of 
temporal  agglomeration  and  asymmetric  fluctuations:  the  significance  and  persistence  of  a  cyclical 
component  in  output  fluctuations,  and  the  importance  of  turning  points  and  business  cycle 
asymmetries.  An  important  feature  of  our  model  is  its  tractability.  Not  only  can  we  characterize  how 
the  persistence  of  business  cycles  and  the  sharpness  of  turning  points  are  determined  but  we  also 
make  direct  contact  with  a  number  of  econometric  studies  which  rely  upon  an  unobserved  component 
model  of  output  growth,  e.g.  Harvey  (1985),  Watson  (1986),  Clark  (1987).  This  generality  enables 
us  to  nest  a  variety  of  different  types  of  business  cycle  asymmetries.  And  this  theme  is  pursued  in 
Section  5  where,  in  the  context  of  our  model,  we  examine  the  circumstances  under  which  turning 
points  between  booms  and  recessions  are  most  marked,  so  that  output  growth  approximates  a  shifting 
regime  process  as  in  the  discrete  state  Markov  model  popularized  by  Hamilton  (1989).  In  all  these 
exercises,  we  find  that  the  form  of  heterogeneity  (the  distribution  of  idiosyncratic  shocks)  is  the  main 
determinant  of  the  time-series  properties,  and  also  of  the  asymmetric  nature  of  fluctuations. 

Section  6  takes  our  model  to  data.  First,  we  look  at  the  connection  between  our  theoretical 
model  and  a  number  of  estimated  regime  shift  models  to  infer  the  size  of  intertemporal  increasing 
returns  necessary  to  explain  U.S.  business  cycles.  However,  the  real  advantage  of  our  formulation  is 
its  ability  to  nest  different  unobserved  component  models  with  different  specifications  regarding  the 
form  of  heterogeneity.  We  thus  estimate  two  versions  of  our  general  model  on  U.S.  data.  This 
exercise  yields  a  number  of  interesting  results.  First,  we  find  that  a  version  of  our  general  model  with 
normally  distributed  idiosyncratic  shocks  does  considerably  better  than  the  version  with  uniformly 


distributed  shocks,  but  this  latter  is  precisely  what  previous  empirical  work  has  used.  Therefore,  our 
model  offers  an  alternative  time-series  representation  grounded  in  economic  theory  that  outperforms 
a  number  of  popular  models.  Moreover,  the  difference  between  the  versions  can  be  heuristically 
thought  to  be  the  importance  of  asymmetry  terms  in  the  version  with  normal  shocks.  Thus,  our 
investigation  also  establishes  that  U.S.  business  cycles  are  asymmetric  and  illustrates  that  these 
asymmetries  manifest  themselves  by  sharp  downswings  in  economic  activity  which  cannot  be  captured 
by  linear  models. 

Since  our  formulation  is  derived  from  an  underlying  economic  model,  we  also  use  the 
estimation  results  to  make  inferences  about  the  nature  of  the  aggregate  economy.  For  instance,  we 
find  that  the  cyclical  dynamics  of  the  U.S.  GNP  cannot  be  characterized  as  discrete  shifts  between 
different  regimes,  which,  in  our  theoretical  model  implies  that  the  idiosyncratic  uncertainty  is  playing 
an  important  role  in  the  propagation  of  economic  shocks  and  this  is  also  confirmed  by  the  fact  that 
we  estimate  the  variance  of  idiosyncratic  shocks  to  be  larger  than  that  of  the  aggregate  shock.  We 
also  again  find  that  the  size  of  non-convexity  implied  by  the  estimation  result  is  modest  and  that 
overall  the  model  provides  a  good  fit  to  the  data. 

2.  Individual  Persistence  and  Intertemporal  Increasing  Returns 

Our  interest  in  temporal  agglomeration  focuses  attention  on  problems  in  which  fixed  costs  are 
important  and  so  agents  have  to  make  a  discrete  choice  about  whether  or  not  to  perform  a  certain 
activity.  A  crucial  feature  of  our  analysis  will  be  the  investigation  of  how  an  agent's  current 
qualitative  choice  (i.e.  whether  or  not  to  perform  a  certain  activity)  influences  the  same  decision 
tomorrow.  In  the  standard  case,  an  activity  that  requires  a  fixed  cost  will  be  bunched  within  a  period 
of  time,  essentially  because  fixed  costs  imply  the  existence  of  infratemporal  increasing  returns  to 
scale.  This  observation  lies  at  the  heart  of  (S,s)  models  (e.g.  Scarf  (1959))  and  implies  that  a  brief 
period  of  activity  is  followed  by  periods  of  inactivity  at  the  individual  level.  However,  while  there 
is  considerable  evidence  supporting  (S,s)  models  (e.g.  Bertola  and  Caballero  (1990),  Caballero  and 
Engel  (1994),  Doms  and  Dunne  (1994),  Cooper,  Haltiwanger  and  Power  (1994)),  there  is  also 
substantial  evidence  that  firm  level  investment  decisions  are  characterized  by  significant  persistence. 
For  instance,  using  UK  firm  level  data,  Bond  and  Meghir  (1994)  find  significant  autoregressive 
effects  in  investment  behavior.  They  estimate  equations  for  the  investment-capital  ratio  and  find  I/K, 
=  a  +  0.856  I,.1/Kt.,(l-0.122  It.,/Kt.1)+pZ,  +  et  where  Zt  is  a  vector  of  firm  relevant  variables  and  e, 
is  a  white  noise  disturbance.  Evaluating  the  quadratic  term  at  its  sample  mean  yields  an  AR(1) 
coefficient  of  around  0.75.  Bond  et  al  (1994)  estimate  AR(1)  and  AR(2)  models  using  Belgian, 
French,  German  and  UK  firm  panel  data  finding  strongly  significant  investment  lags,  with  the  sum 
of  the  autoregressive  coefficients  around  0.3.  Even  the  evidence  in  Doms  and  Dunne  (1994)  reveals 
that  there  is  significant  investment  occurring  in  most  years  of  the  sample  for  each  firm  and  that 
concentrated  investment  bursts  are  spread    over  several  years.  These  findings  suggest  that  while 


infratemporal  economies  of  scale  (lumpiness)  are  important  there  may  also  be  intertemporal  linkages 
which  can  profitably  be  exploited.  This  raises  the  question,  what  form  of  investment  produces  these 
intertemporal  linkages? 

The  most  obvious  form  of  intertemporal  economies  of  scale  is  learning-by-doing,  whereby 
past  experiences  increase  productivity  or  reduce  costs.  More  explicitly,  consider  the  case  where 
incorporating  new  knowledge  is  a  slow  and  costly  process,  limiting  the  degree  to  which  productive 
investments  can  be  undertaken  within  a  period.  However,  the  more  familiar  an  individual  is  with  the 
most  recent  technology  vintages,  the  cheaper  it  is  to  adopt  the  latest  version.  In  contrast  an  individual 
not  using  the  most  recent  vintage  faces  compatibility  problems  when  dealing  with  the  frontier 
technology.  The  result  will  be  that  only  limited  innovative  moves  are  taken  within  each  period,  with 
cost  incentives  making  forward  steps  more  likely  to  come  from  active  agents.  The  idea  that 
intertemporal  linkages  might  lead  agents  to  take  a  sequence  of  small  investment  steps  is  given 
empirical  support  in  studies  of  technological  innovation.  In  this  literature  a  distinction  is  drawn 
between  incremental  improvements  of  an  existing  technology  and  radical  improvements  which  destroy 
the  existing  technology  and  start  an  alternative  production  or  organizational  method,  e.g.  Tushman 
and  Anderson's  (1986)  "incremental  change"  versus  "technological  discontinuities".  There  is 
widespread  agreement  that  the  less  spectacular  incremental  changes  account  for  most  of  the 
productivity  gains  (see  Abernathy  (1980),  Myers  and  Marquis  (1969)  and  Tushman  and  Anderson 
(1986)). 

More  importantly  for  our  paper,  there  is  also  a  consensus  that  incremental  innovations  are 
more  likely  to  come  from  firms  who  have  been  active  at  the  earlier  stages  of  product  development. 
In  Freeman's  (1980,  p.  168)  words  "the  advance  of  scientific  research  is  constantly  throwing  up  new 
discoveries  and  opening  up  new  technical  possibilities,  a  firm  which  is  able  to  monitor  this  advancing 
frontier  by  one  means  or  another  may  be  one  of  the  first  to  realize  a  new  possibility".  Arrow  (1974) 
and  Nelson  and  Winter  (1982)  also  emphasize  the  advantages  possessed  by  incumbent  innovators  in 
being  able  to  further  cope  with  incremental  changes.  For  instance,  Nelson  and  Winter  stress  the 
importance  of  engineers  understanding  new  technologies,  suggesting  that  firms  who  innovate  have 
an  advantage  in  doing  so  again.  Abernathy  (1980,  p.70),  using  evidence  from  industries  diverse  as 
automobiles,  airlines,  semi-conductor  and  light  bulb  manufacturing,  notes  that  "Each  of  the  major 
companies  seems  to  have  made  more  frequent  contributions  in  a  particular  area"  suggesting  that 
previous  innovations  in  a  field  facilitate  future  innovations.  One  possible  explanation  of  these  findings 
are  fixed  effects:  some  firms  may  simply  be  good  at  innovating  in  certain  areas.  However,  the 
industry  wide  work  of  Hirsch  (1952),  Lieberman  (1984)  and  Bahk  and  Gort  (1993)  suggests  that  more 
than  just  individual  fixed  effects  is  operating.  In  the  remainder  of  this  paper  we  shall  focus  on  this 
form  of  investment  and  its  implications  for  aggregate  fluctuations. 
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3.  Individual  Behavior  in  the  Presence  of  Intertemporal  Increasing  Returns 

(i)  The  Environment 

We  assume  that  agents  (or  firms)  are  risk-neutral  and  forward  looking,  and  seek  to  maximize 
their  return  (either  profit  or  utility).  The  key  decision  is  whether  or  not  to  invest  in  order  to  benefit 
from  technological  progress.  More  explicitly,  each  period  a  new  technology  becomes  available.  This 
technology  has  a  stochastic  productivity  that  is  revealed  at  the  beginning  of  the  period.  The  agent 
decides  whether  to  adopt  this  technology  or  not,  for  instance  whether  to  install  a  new  software.  If  the 
technology  is  adopted,  the  productivity  of  the  agent  increases  permanently,  starting  from  the  current 
period3.  To  obtain  the  highest  return  from  this  innovation,  its  compatibility  with  existing  technologies 
needs  to  be  monitored.  In  particular,  at  the  end  of  the  installation  period  there  is  the  option  to  gain 
additional  productivity  from  the  innovation  by  maintenance4  (e.g.  fine-tuning).  Under  these 
assumptions  the  firm's  output  is 

y,=>Vrao+(ai+M/>Va:2y/(1-m/)  0) 

where  a,,,  a,  and  o^  are  positive  parameters  and  s,  and  m,  are  binary  decision  variables  that  equal  1 
if  investment  (st)  and  maintenance  (n\)  are  undertaken  this  period,  and  equal  zero  otherwise.  If  the 
new  technology  is  adopted  (st=l),  the  productivity  of  the  firm  is  higher  permanently.  When  there  is 
no  maintenance  effort  at  the  end  of  the  period  (mt=0),  this  increase  in  productivity  is  not  as  large  as 
it  could  be  (a,-0C2+ut  instead  of  o^+u,).  Deterministic  depreciation  is  denoted  by  Oq  (our  results  are 
unchanged  if  depreciation  is  avoided  when  m,=l).  We  assume  that  u,  is  a  serially  uncorrected  random 
shock  to  the  productivity  of  investment  with  distribution  function  denoted  by  F(.).  Maintenance  costs 
are  assumed  to  be  equal  to  a  positive  constant,  y0  (i.e.  Ctm=y^).  A  distinctive  feature  of  (1)  is  that 
as  in  (S,s)  models,  our  focus  is  on  decisions  that  have  a  discrete  nature,  e.g.  whether  to  adopt  the 
recent  technology  vintage  or  whether  to  maintain  existing  machines.  The  discrete  nature  of  the  choice 
explains  why  the  productivity  shock  in  (1)  is  multiplicative  with  the  investment  decision;  productivity 
shocks  are  associated  with  one  vintage  of  technology  and  therefore  only  affect  output  when  the  new 
technology  is  adopted. 

In  this  model,  maintenance  also  has  an  additional  role  other  than  ensuring  the  maximum 
performance  of  the  latest  innovation.  More  specifically,  we  assume  investment  costs  are  given  by: 

cr(YrY2™,-i>,  (2) 

where  both  y,  and  y2  are  positive  so  that  when  equipment  is  maintained  the  firm's  investment  costs 
next  period  are  lower.  Here  y2  is  the  extra  investment  cost  incurred  by  a  plant  which  did  not  maintain 


Time-to-build  considerations  can  easily  be  incorporated  in  the  return  function  and  only  serve  to  change  the  timing 
of  returns. 

4  See  Pennings  and  Buitendam  (1987)  for  the  importance  of  maintenance  type  activities. 


last  period.  In  terms  of  our  computing  example,  all  existing  bugs  in  the  system  need  to  be  removed 
to  ensure  the  optimal  implementation  of  the  new  software,  implying  that  next  period  the  instalment 
of  another  new  program  is  less  costly.  In  contrast,  if  the  computer  is  not  maintained,  technical 
assistance  is  necessary  both  at  the  beginning  of  the  period  for  installation  (to  remove  bugs)  and  at  the 
end  of  the  period  (for  fine  tuning  to  increase  productivity  if  so  decided). 

Each  period  the  firm  decides  whether  to  invest  (st=0  or  1)  and  also  whether  to  maintain  (m,=0 
or  1).  Denoting  the  discount  factor  of  the  representative  firm  by  P  and  the  per  period  return  by  r(.), 
we  have: 

r(slvjnlv;mtij_vyltj_vulv)= 

>'/y-rao+(ai+M/^-a^v<1"m^"Yom<v-(YrY2m/+;-i>y/+; 

and  the  maximization  problem  of  the  firm  at  time  t  is: 

max  E,{  Jl^r(slv^ntv;ml+j_^H.vut^ }  (4) 

subject  to  (1)  and  taking  y,.,,  ut,  m,.,  as  given.  In  period  t+j,  the  state  variables  are  m,^.,,  ut+j,  yt+j.,  and 
the  choice  variables  are  sI+j  and  m,+j. 

Whether  the  firm  invests  in  any  period  depends  on  ut.  In  contrast,  the  return  to  maintenance 
only  depends  on  whether  or  not  investment  occurs.  If  no  current  investment  is  undertaken,  the  only 
benefit  of  maintenance  is  the  cost  reduction  that  it  brings  in  the  following  period  if  investment  then 
occurs.  In  contrast,  if  there  is  current  investment,  future  productivity  also  increases  by  ctj.  As  a  result 
there  are  three  possibilities  regarding  maintenance:  (i)  always  maintain  (ii)  never  maintain,  (iii) 
maintain  only  when  there  is  investment.  Because  we  wish  to  make  maintenance  a  decision  of  the  firm 
rather  than  a  fixed  characteristic,  we  concentrate  on  (iii).  To  this  end  we  assume5  : 

Assumption  A: 

0Y  <Y  <A_  (5) 

H  2      °    1-/3 

The  first  inequality  can  be  understood  by  noting  that  Py2  is  the  maximum  benefit  from  maintenance 
in  the  absence  of  current  investment:  if  there  is  investment  next  period  costs  are  lower  by  y2, 
otherwise  there  is  no  benefit.  Therefore,  the  first  part  of  the  inequality  implies  maintenance  is  not 


Assumption    A   is    stronger   than    we   require   but   simpler   to    understand    than    the    necessary    condition, 
j3y2I         dF(u)<y0< +/3y7  |         dFCu)   ,  where  a\,  and  go,  are  constants  defined  below. 


worthwhile  just  to  obtain  future  cost  savings.  In  the  presence  of  current  investment,  the  minimum  gain 
from  investment  is  the  present  value  of  the  productivity  increase  due  to  maintenance,  (^/(l-P). 
Consequently  the  second  part  of  the  inequality  states  that  even  without  future  cost  savings,  it  is 
profitable  to  maintain  if  there  is  current  investment.  It  follows  that  when  (5)  holds,  we  can  limit  our 
attention  to  the  case  where  the  firm  only  maintains  when  it  invests,  thus  mt=s„  and  the  per  period 
return  simplifies  to: 

Kv^iVJ^^ai^"Mv^  (6) 

where  50=Y0+Yi  and  5l=y2-  Assumption  A  therefore  enables  us  to  write  our  problem  in  a  way  which 
focuses  on  the  intertemporal  increasing  returns  arising  from  the  interactive  term  5,st+jst+j.,.  More 
generally,  (6)  can  be  interpreted  as  the  period  return  for  a  firm  which  benefits  from  lower  current 
investment  costs,  if  it  invested  last  period.  Under  this  interpretation  our  model  captures  the  idea  that 
firms  who  are  used  to  adapting  to  a  changing  environment  face  lower  costs  of  further  changes. 

(ii)  Optimal  Decision  Rules 

Using  (6)  the  maximization  problem  at  time  t  can  be  written  in  a  dynamic  programming  form 
with  the  value  function  V(.)  satisfying: 

V(yt_^t_vu)=sup  {/-(^;y/.1^.1,u/)+/3£;/K(y/.1-a0+(a1+v/>s^^/+1)}  (7) 

Solving  the  agent's  optimization  problem  we  obtain  (proof  in  the  appendix): 
Proposition  1:  The  value  function 

sr\    and    ^(yM^M,u><V<^M+<to-i+te 

*/"^<«VwrVi  (8) 

s=0    and    Vfy,.^,.^,)^^,-! 

if    K^Qo-to^ 

is  the  unique  function  that  satisfies  (7)  (<|)'s  denote  constant  coefficients). 

To  understand  the  dichotomous  nature  of  the  value  function,  consider  the  case  where  the  agent 
does  not  invest  (st=0).  The  disturbance  ut  is  irrelevant  to  future  profits  and  with  no  investment  costs, 
it  does  not  matter  whether  the  firm  invested/maintained  last  period  and  so  the  value  function  depends 
only  upon  y,.,.  However,  if  the  firm  invests  the  value  function  depends  linearly  upon  yt.„  st.j  and  ut. 
The  optimal  choice  of  st  depends  on  whether  the  investment  shock,  u„  is  above  a  certain  critical 
value.  Shocks  below  this  value  lead  to  insufficiently  high  productivity  to  be  embodied  in  the  firm's 
technology.  The  critical  value  depends  on  st.,  due  to  the  intertemporal  non-separability  in  the  cost 
function.  To  understand  what  determines  these  critical  values  note  that  the  firm  invests  this  period 


iff  the  difference  between  the  left  hand  side  of  (7)  evaluated  at  s,=l  and  0  is  positive.  Using  the 
Appendix  and  (8),  the  agent  invests  iff 

(Jg  oo  <">0 

[1-/8  /  ^a'xl^o,^   /  dF(uhl)+J-  /  WM^(u,J]+SlVl+rU,>0       (9) 

It  is  this  inequality  which  implicitly  defines  the  coefficients  co0  and  co,  in  (8)  (see  (A3)  in  the 
Appendix).  The  intuition  behind  this  expression  is  a  good  way  of  illustrating  the  main  features  of  our 
model.  The  firm  is  comparing  the  strategy  s,=l  with  st=06.  If  st=l  production  increases  by  oc,+ut  for 
all  periods  compared  to  s,=0,  which  has  a  net  present   value  (including  costs)  of 

(i-/?)>1+u,)-(S0-SrVi)  (10> 

Any  further  benefits  from  choosing  st=l  depend  upon  future  values  of  ut.  There  are  three 
possible  cases: 

(i)  If  ut+1e  (-oo,(00-C0i)  the  agent  will  not  invest  regardless  of  st  and  there  are  no  consequences 
beyond  (10). 

(ii)  If  ut+1e  [cOq-co^cOo)  the  household's  t+1  investment  decision  depends  upon  st.  The  shock 
is  only  favorable  enough  for  investment  if  the  household  benefits  from  cost  savings  arising  from  past 
investment  i.e.  st+1=l  if  st=l  and  st+I=0  otherwise.  In  this  case,  investing  today  means  a  difference  in 
expected  discounted  value  next  period  of 

u°  <->o 

P  {  dF(um)x{(l-P)\a0+  f  u^dFiu^y^-SJ}  (11) 


u0-u)[ 


where  the  first  integral  represents  the  probability  that  ut+1e  [co0-cO|,co0)  and  the  second  is  the  expected 
value  of  u,+1  conditional  on  ut+1e  [o)0-a),,{00).  If  both  ut+1  and  ut+2  fall  in  the  region  [(%-<»,, co0)  this  same 
additional  benefit  accrues  in  t+2,  suitably  discounted.  In  other  words,  st+2  only  equals  1  when  st+1=l, 
which  in  turn  will  only  be  the  case  when  st=l.  As  {ut}  is  assumed  an  i.i.d  sequence  this  additional 
benefit  at  t+2  is  (1 1)  multiplied  by  pProb(co0-a>,<ut+1<a>0).  A  similar  logic  holds  for  all  future  periods 
and  summing  over  time  yields 

{1-0  /  dF(uhl)}lx[{p  /  iF(uM)}x{^+5rS0}+J-  /  u^dFiu,^)]  (12) 


Our  results  are  unchanged  if  m,  and  s,  lie  in  the  continuum  [0,1]  rather  than  take  discrete  values.  In  this  case,  if 
(9)  holds  as  a  strict  inequality  agents  choose  one  corner,  s,=l  and  mt=l;  if  (9)  is  strictly  negative,  st=mt=0.  If  (9) 
holds  as  an  equality  agents  are  indifferent  between  any  choice  that  has  m,=st.  If  in  this  case  we  impose  that  the  agent 
chooses  s,=m,=l  our  results  hold  exactly. 
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Equation  (12)  is  the  net  expected  value  of  all  future  investment  projects  the  firm  benefits 
from  only  if  it  invests  today.  If  the  firm  does  not  invest  at  time  t,  and  future  shocks  fall  in  the  region 
[co0-co,,to0),  the  firm  will  not  invest  in  the  future.  However,  if  the  firm  invests  today  the  same 
sequence  of  shocks  leads  the  firm  to  invest  and  reap  the  benefits  given  by  (12).  Thus  there  is  an 
important  asymmetry  in  the  way  firms  respond  to  investment  shocks  in  high  and  low  activity  states, 
and  as  a  consequence  the  marginal  propensity  to  invest  varies  between  these  states.  This  state 
dependence  relies  entirely  on  0),>0,  which  from  (A3)  in  the  appendix  is  equivalent  to  8,>0,  or  in  other 
words,  the  presence  of  intertemporal  increasing  returns. 

(iii)  Finally,  if  ut+1e  [co0.°°).  the  shock  is  so  favorable  agents  invest  regardless  of  whether  they 
benefit  from  lower  costs.  However,  while  investment  decisions  are  the  same  irrespective  of 
s,.„  costs  are  not.  If  st  =  1  the  cost  of  choosing  st+1=l  is  lower  by  8„  which  has  expected  present 
value  of  (38,(l-F(co0)).  The  same  benefit  accrues  at  t+2,  if  both  u,+1e  [©0-co,,co0)  and  ut+2e  [co0,°°),  with 
expected  value  of  f3281[F(a)0)-F(co0-co,)][l-F(aj0)],  with  similar  expressions  holding  for  t+3,  etc. 
Summing  over  time  gives 

Wq  CO 

{1-/3  /  dF(utJY^jdF{utJ  (13) 

"o-"i  "o 

This  expression  represents  the  reduction  in  future  costs  arising  from  current  investment  and  again 
reflects  the  persistence  in  s„  captured  by  the  integral  between  (£>q-(jsx  and  u)0. 

The  sum  of  (10),  (12),  and  (13)  is  equal  to  (9)  and  characterizes  the  optimal  decision  rule  of 
households.  The  most  important  feature  of  the  decision  rule,  summarized  by  (9),  is  the  dependence 
of  current  actions  on  past  decisions.  Due  to  intertemporal  increasing  returns,  shocks  in  the  range 
[co0,co0-a>,)  lead  to  investment  if  received  by  an  agent  who  has  been  active  in  the  past  (s,.,=l)  but  not 
for  an  agent  who  has  not  invested  at  t-1,  making  individual  behavior  persistent. 

4.  Cyclical  Fluctuations  in  the  Aggregate  Economy 

(i)  Characterizing  Output  Fluctuations 

We  now  turn  to  the  implications  of  individual  level  intertemporal  increasing  returns  for 
aggregate  economic  fluctuations.  We  assume  the  economy  consists  of  a  continuum  of  agents, 
normalized  to  1,  each  of  whom  faces  the  technology  described  above.  Because  the  intertemporal 
increasing  returns  are  internal  to  the  firm,  we  allow  for  correlation  between  the  different  investment 
decisions  (denoted  s,'  for  agent  i)  of  heterogenous  agents  by  assuming  the  existence  of  an  aggregate 
productivity  shock  vt.  Heterogeneity  is  first  introduced  via  an  idiosyncratic  shock,  £,'  such  that  each 
agent  receives  a  shock  u^Vj+e,'.  We  assume  that  £,'  is  drawn  from  a  common  distribution  function 
G(.)  (with  associated  density  function  g(.))  and  that  v,  is  i.i.d  with  distribution  function,  H(.),  and 
density  function,  h(.).  Finally,  we  assume  £,'  is  uncorrected  between  individuals  and  over  time.  Both 
shocks  are  normalized  to  have  zero  mean  and  are  assumed  to  be  observed  before  agents  make  their 


investment  decisions. 

The  decision  rule  of  the  individual  is  given  in  section  3  and  takes  the  form  of  invest  iff  ut'>(00- 
cOjS,.,'.  Conditioning  on  the  aggregate  shock,  this  can  be  written  as,  invest  iff: 

€t>w0-(i>1s;_rvt  (i4) 

where  co0  and  to,  are  defined  by  the  distribution  of  ut\  F(.).  Investment  decisions  vary  among  firms 
due  to  the  idiosyncratic  shock,  requiring  st  to  be  indexed  by  i.  Defining  St  as  the  proportion  of  agents 
that  invest  in  period  t  (equivalently,  the  aggregate  propensity  to  invest),  we  have: 

Proposition  2: 

Aggregate  output  follows  the  process 

Ay/=|Ay/^=a°+a15/+    f    u/di 
ie[s'=i\ 

oo  "nV,  (15) 


=a°+(a1+v/)V  /  «8(*>k+  S„     f    eg(e)de 


where 


S,={1-G(<vvJ>(1-SmM1^(<Vwi^,)>*m  (l6) 

={l-G(<D0-v,)}+{GK-v>G(Q0-<Drv,)}SM 


To  understand  (16),  note  that  of  the  S,.,  firms  who  invested  last  period  only  those  with  an 
idiosyncratic  shock  greater  than  cOo-co^v,  will  invest  now.  Therefore,  the  measure  of  firms  who  invest 
in  two  successive  periods  is  (l-G(co0-co,-vt))St.1.  Of  the  (1-S,.,)  firms  who  did  not  invest  only  those 
with  an  idiosyncratic  shock  greater  than  co0-vt  will  do  so  now  and  their  measure  is  (l-G(co0-v,))(l-St.1). 
Therefore,  as  shown  in  Fig.l,  St  is  a  weighted  average  of  points  on  the  distribution  function  of 
idiosyncratic  shocks,  where  the  location  of  these  points  depends  upon  the  aggregate  shock  and  the 
weights  depend  on  St.,. 

(ii)  The  Nature  of  Business  Cycle  Fluctuations 

Proposition  2  outlines  an  unobserved  components  model  for  GNP,  as  the  law  of  motion  for 
output  consists  of  both  a  measurement  equation,  (15),  and  a  state  equation,  (16).  The  importance  of 
the  state  equation  is  that  due  its  non-linear  nature,  it  enables  us  to  explain  the  asymmetries  and  the 
temporal  agglomeration  discussed  in  the  introduction.  St  represents  the  average  number  of  firms  who 
are  investing  and  is  most  naturally  interpreted  as  the  cyclical  component  of  output,  which  is  crucial 
in  generating  output  fluctuations  (e.g.  Watson  (1986),  Hamilton  (1989)).  Variations  in  St  not  only  alter 
the  growth  rate  of  output  via  (15),  but  also  provide  persistence  because  S,  follows  a  time  varying 
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AR(1)  process  with  autoregressive  coefficient  equal  to  G(o)0-v,)-G(to0-co1-vt).  Persistence  is  caused  by 
shocks  in  the  region  [co0-u)„a)0):  in  the  case  where  st'=0  a  value  of  ut+1  in  this  region  means  it  is 
optimal  for  sl+1'=0,  whereas  with  st'=l,  st+I'=l  would  be  optimal.  Therefore,  agents  who  invested  last 
period  have  a  higher  propensity  to  invest  this  period  than  those  who  did  not.  In  the  aggregate  this 
implies  that  i.i.d  shocks  are  converted  into  persistent  cyclical  fluctuations.  The  autoregressive  nature 
of  the  cyclical  component  S,  relies  upon  8,  being  non-zero  and  so  depends  entirely  on  intertemporal 
increasing  returns,  if  8,=0  S,  is  white  noise.  The  need  for  an  unobserved  components  approach  arises 
from  the  fact  that  monitoring  output  growth  requires  tracking  the  number  of  agents  who  are/are  not 
investing.  This  is  needed  because  each  type  of  agent  responds  to  shocks  differently,  each  group 
displaying  persistence  in  their  behavior.  As  a  result,  it  is  not  sufficient  to  monitor  only  the  shocks 
which  impact  on  the  economy,  but  also  who  will  be  affected  by  them. 

The  AR  parameter  in  (16)  is  in  general  time  varying  and  this  feature  enables  us  to  account 
for  the  empirical  findings  of  business  cycle  asymmetries;  essentially,  the  impact  effect  of  aggregate 
shocks  on  output  varies  over  the  business  cycle.  Referring  back  to  Fig.l,  changes  in  vt  shift  the 
position  of  the  two  points  along  the  horizontal  axis:  a  high  vt  shifts  the  chord  AB  down  and  St  will 
be  higher.  However,  the  exact  impact  that  vt  has  depends  upon  both  S,.,  and  the  cross  sectional 
distribution  of  shocks,  i.e.  upon  both  the  slope  of  the  chord  AB  (which  is  determined  by  G(.)  and  Vj) 
and  the  weights  on  the  two  points  (determined  by  S,.,).  As  a  result,  the  non-linear  autoregressive  form 
of  (16)  is  a  source  of  path  dependence  in  our  model  as  well  as  persistence;  a  shock  which  changes 
St_,  both  affects  St  through  the  AR  coefficient  but  also  alters  the  way  that  the  economy  responds  to 
future  shocks  due  to  the  interaction  between  vt  and  St_,. 

However,  under  the  assumption  of  uniformly  distributed  idiosyncratic  shocks  these  asymmetric 
interactions  between  St_,  and  vt  are  absent.  In  this  case  the  AR  coefficient  in  (16)  is  constant  and  (15) 
and  (16)  approximate  the  standard  "return  to  normality"  state  space  model,  variants  of  which  have 
been  used  to  successfully  model  U.S.  GNP  by  Harvey  (1985),  Watson  (1986)  and  Clark  (1987)7. 
More  generally,  (15)  and  (16)  allow  for  a  number  of  alternative  component  based  models  which 
account  for  a  wide  range  of  asymmetries  and  cyclical  fluctuations.  Each  of  these  different  models  of 
the  cycle  arise  from  alternative  assumptions  on  the  distribution  of  idiosyncratic  shocks.  However, 
although  our  model  allows  an  important  role  for  heterogeneity,  we  do  not  need  to  monitor 
complicated  changes  in  the  cross-sectional  distributions  to  keep  track  of  the  state  of  the  economy: 
what  matters  for  business  cycles  is  not  the  exact  relative  position  of  each  agent  over  time  but  the 
distribution  function  for  idiosyncratic  shocks  around  particular  ranges.  The  latter  serves  as  a  sufficient 
statistic  for  all  distributional  issues,  considerably  simplifying  the  analysis  of  the  next  subsection. 


The  uniform  distribution  case  only  approximates  the  return  to  normality  model  due  to  the  non-normality  of  the 
measurement  equation  disturbance  and  because,  if  v,  is  such  that  either  o^-v,  or  a^-Wi-v,  is  outside  the  support  of 
£j,  the  autoregressive  coefficient  is  no  longer  constant. 
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Motivated  by  these  observations,  we  now  turn  to  investigate  how  the  degree  of  intertemporal 
increasing  returns  and  heterogeneity  interact  to  determine  the  time  series  properties  of  the  aggregate 
economy,  and  of  specifically  the  cyclical  component  St. 

(Hi)  Determinants  of  the  Time  Series  Properties  of  the  Business  Cycle 

The  non-linear  nature  of  our  model  means  there  is  no  unique  definition  of  persistence,  but  one 
candidate  is  the  degree  of  serial  correlation  in  S,  conditional  upon  vt: 


Given  G(.)  is  a  distribution  function  we  have: 

Corollary  1:  Business  cycle  persistence  is  non-decreasing  in  8,,  the  degree  of  intertemporal  increasing 
returns. 

Persistence  is  due  to  the  fact  that  some  agents  have  shocks  in  the  interval  [(D0-g>i>  co0)  and 
invest  in  this  period  only  because  investment  costs  are  lower  due  to  recent  high  activity.  As  co, 
determines  the  measure  of  these  marginal  agents,  and  is  itself  a  positive  function  of  8„  serial 
correlation  is  strengthened  when  intertemporal  increasing  returns  are  higher. 

To  illustrate  how  the  nature  of  the  business  cycle  varies  with  different  degrees  of  increasing 
returns  consider  the  following  illustrative  simulation.  Assume  aggregate  and  idiosyncratic  uncertainty 
to  be  equally  important  with  both  having  a  variance  of  0.25,  the  former  being  normally  and  the  latter 
uniformly  distributed,  and  let  the  gains  from  learning-by-doing,  a,,  be  equal  to  1.52  (see  Section  6.1 
for  a  justification  of  this  choice).  Figs.  2  and  3  show  the  cyclical  component  arising  from  these 
assumptions  for  the  case  5,/50=l/3  and  1/2.  Given  the  strong  path  dependence  in  our  system  Figs.2 
and  3  are  drawn  for  the  same  (suitably  scaled)  sequence  of  random  shocks.  Both  figures  illustrate  how 
intertemporal  increasing  returns  convert  i.i.d  shocks  into  cyclical  fluctuations,  however,  the  cyclical 
indicator  is  far  more  persistent  in  Fig.3  than  in  Fig.2.  The  increased  learning-by-doing  persuades  firms 
to  continue  to  invest  even  in  the  presence  of  mediocre  productivity  shocks,  significantly  reducing  the 
noise  in  the  cyclical  component. 

To  understand  the  impact  of  distributional  affects  on  the  cyclical  component  we  turn  to  a  more 
general  measure  of  persistence  than  (17),  which  was  defined  conditional  on  a  given  value  of  vt. 
Integrating  across  all  possible  values  of  the  aggregate  shock,  we  arrive  at  a  global  measure  of 
persistence; 

H-zrkww  (18) 


asM 
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Taking  a  first-order  Taylor  expansion  of  P  around  m,,  the  mean  of  v,  which  is  assumed  zero,  we  obtain 

By  definition  the  second  term  is  zero,  and  if  we  take  a  further  first-order  Taylor  expansion  of  G(.) 
around  co0,  P  can  be  approximated  by  g(oo0)co,.  Thus  we  can  state 

Corollary  2:  For  given  (00  and  co,,  an  increase  in  g((00)  will  increase  P. 

The  intuition  behind  this  corollary  is  again  related  to  the  fact  that  individual  persistence  arises 
from  shocks  in  the  region  [co0,  (Oq-co,).  In  the  aggregate  economy  the  distribution  of  idiosyncratic 
shocks  is  important  because  it  determines  the  density  of  agents  who  are  around  this  critical  region. 
For  our  global  measure  of  persistence,  g(co0)co,  is  a  measure  of  the  number  of  such  marginal  agents, 
so  that  the  higher  is  g(co0)  the  greater  is  serial  correlation.  Corollary  2  therefore  implies  that  spreads 
of  g(.)  around  co0  (which  will  often  be  produced  by  increases  in  heterogeneity,  represented  by 
increases  in  the  variance  of  e')  will  reduce  persistence  to  the  extent  they  lower  the  number  of  agents 
in  the  region  [co0,  co0-co,).  The  intuition  of  Corollary  2,  that  the  persistence  in  the  dynamics  of  this 
economy  are  linked  to  the  form  of  the  distribution  of  the  idiosyncratic  shocks,  is  important  for  our 
results  in  section  6,  where  we  will  investigate  the  empirical  performance  of  different  versions  of  this 
general  models  distinguished  by  the  form  of  the  heterogeneity  and  thus  by  the  degree  of  asymmetry 
in  the  law  of  motion  for  output  growth. 

Now,  to  further  analyze  the  impact  of  aggregate  uncertainty  on  the  business  cycle  we  take  a 
second-order  Taylor  expansion  of  (17)  around  u^  followed  by  an  additional  first-order  Taylor 
expansion  around  co0.  This  gives: 

P^Gi^yGi^-^yigX^yg'^o-^Wariy,) 

s%o)^(fflo-ui)V'WuM) 

so  we  have; 

Corollary  3:  Increases  in  the  variance  of  aggregate  shocks  reduce  (increase)  the  persistence  of  the 
cyclical  component  if  g(.)  is  concave  (convex)  around  oo0. 

Corollary  3  states  the  surprising  result  that  for  a  large  class  of  idiosyncratic  distributions,  the 
more  volatile  are  aggregate  investment  shocks,  the  less  important  is  the  business  cycle  -  in  the  sense 
that  the  cyclical  component  becomes  less  persistent  and  aggregate  output  fluctuations  are  increasingly 
driven  directly  by  vt  and  not  by  the  state  equation.  The  intuition  behind  this  is  that  when  g(.)  is 
concave  around  a>0,  increased  volatility  of  the  aggregate  shock  leads  to  the  critical  investment 
threshold  being  located  at  points  with  low  density.  In  contrast,  in  the  case  of  a  convex  g(.)  function 
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at  co0,  more  aggregate  variability  will  take  us  to  values  of  the  density  function  that  are  on  average 
higher  and  thus  increase  serial  correlation  due  to  the  increased  weight  of  marginal  agents. 

(iv)  Structural  Heterogeneity 

The  purpose  of  this  subsection  is  to  show  that  when  we  extend  our  model  to  allow  for 
different  types  of  heterogeneity  across  agents,  the  importance  of  the  cyclical  component  and  the 
degree  of  non-linearities  associated  with  the  business  cycle  may  be  enhanced.  In  particular  we  show 
that  increasing  the  dispersion  of  agents  can,  under  certain  conditions,  increase  the  amount  of 
persistence  provided  by  our  cyclical  indicator.  Given  that  {St}  represents  the  extent  of  co-movement 
between  agents,  this  is  a  surprising  result  which  re-iterates  the  limitations  of  representative  agent 
models. 

We  have  so  far  only  considered  what  may  be  termed  stochastic  heterogeneity  using  the 
terminology  of  Caballero  and  Engel  (1991),  that  is  heterogeneity  in  the  form  of  idiosyncratic  shocks. 
Whereas  in  practice,  agents  also  differ  significantly  in  their  technological  possibilities/preferences  as 
well  as  the  opportunities  they  face,  often  captured  in  microeconometric  work  by  the  inclusion  of  fixed 
effects.  To  include  these  influences  we  consider  structural  heterogeneity  by  allowing  investment  cost 
functions  to  be  firm  specific.  We  assume  agent  i  has  investment  costs  given  by8 

Using  the  solution  outlined  in  Section  3  each  agent  invests  iff 

eJ^Oo-w^i-v,  (22) 

where  the  distribution  function  of  co0'  is  T(.)  with  support  set  U  and  is  determined  from  the 
distribution  of  the  cost  parameter,  80'.  Increases  in  the  degree  of  structural  heterogeneity  are  equivalent 
to  mean-preserving  spreads  of  IX).  The  law  of  motion  for  S,  is  now 

^KlA1){/(l-G("i.-v/)yr(a)i))}+5/.1{/(l-G(a)i)-o>1-v/)yr(a>i))} 

u  u  (23) 

={/(l-G(o);-v/))dr(a>;)}+5M{/{G(a);-v/)-G(a)^-(o1-v/)Mr(a);)} 
u  u 

Applying  the  same  two  successive  Taylor  expansions  as  in  the  last  subsection,  our  global  measure 
of  persistence  becomes 

P~f„  [^-Gtoj-aOWui-tt!/,,  S(*>uW<d0)  (24) 

Corollary  4:  If  g(.)  is  convex  (concave),  mean  preserving  spreads  of  T(.)  increase  (decrease)  P. 


The  intertemporal  increasing  returns  parameter,  5,,  can  similarly  be  made  individual  specific. 
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Therefore,  an  increased  dispersion  of  agents  in  the  form  of  greater  structural  heterogeneity  can 
interact  with  stochastic  heterogeneity  to  increase  the  persistence  generated  through  the  cyclical 
component.  The  intuition  here  is  that  if  g(.)  is  convex,  the  averages  of  neighboring  points  will  be 
higher  than  g(.)  itself.  Since  the  higher  is  g(.)  the  higher  is  the  measure  of  agents  in  the  critical  region 
where  investment  is  only  profitable  when  costs  are  low,  in  the  case  of  convex  g(.),  the  presence  of 
structural  heterogeneity  leads  to  a  more  serially  correlated  process  for  {St}.  We  can  therefore  see  that 
a  mean-preserving  spread  (an  increase  in  structural  heterogeneity),  by  extending  the  cross  sectional 
distribution  of  co0'  to  regions  of  g(.)  that  are  higher,  increases  persistence  in  the  economic  state. 
Conversely  when  g(.)  is  globally  concave,  an  increase  in  structural  heterogeneity  will  reduce 
persistence.  The  message  from  this  section  is  clear,  far  from  removing  the  non-linear  nature  of  the 
business  cycle,  heterogeneities   have  a  key  determining  influence  on  economic  fluctuations. 

5.  Regime  Shifts  in  Economic  Fluctuations 

One  of  the  attractions  of  fixed  cost  models  is  that  the  discrete  individual  behavior  they  imply 
can  explain  the  sharpness  of  business  cycle  turning  points.  While  our  general  model  implies  that  the 
cyclical  component  is  always  serially  correlated,  it  does  not  impose  any  conditions  on  the  nature  of 
turning  points.  A  number  of  studies  (e.g.  Hamilton  (1989),  Acemoglu  and  Scott  (1994),  Suzanne 
Cooper  (1994),  Diebold  and  Rudebusch  (1994))  have  achieved  empirical  success  in  modelling  output 
fluctuations  by  assuming  the  business  cycle  to  be  characterized  by  regime  shifts,  that  is  by  abrupt 
moves  from  recession  to  expansion  (or  vice  versa).  In  the  next  subsection  we  investigate  the  factors 
which  determine  the  nature  of  turning  points  in  our  model,  establishing  the  circumstances  under  which 
models  with  regime  shifts  serve  as  a  good  approximation. 

(i)  The  Nature  of  Turning  Points 

To  analyze  this  issue  we  use  our  basic  model  with  only  stochastic  heterogeneity,  and  focus 
on  the  probability  of  a  given  S,  occurring  conditional  upon  S,.,,  which  is  : 

p(5|5> *G2 (25) 

(i-s^K-v)+s'sK-<vv) 

where  p(S  |S')  denotes  the  density  of  St  conditional  on  S,.,=S'  and  v,  is  written  as  an  implicit  function 
of  S,  by  (16). 

The  extreme  case  of  regime  shifts  corresponds  to  the  case  where  St=0  or  1  so  that  p(S  |S') 
has  its  mass  concentrated  at  two  particular  points.  Similarly  in  more  general  characterizations,  if 
p(S  |S')  has  marked  peaks,  then  transitions  between  different  states  take  on  the  character  of  regime 
shifts.  A  necessary  (but  not  sufficient)  condition  for  p(S|S')  to  have  its  mass  concentrated  in  two 
different  intervals  is  that  it  should  be  non-unimodal.  Thus,  if  we  can  establish  p"(S  |S')  is  positive 
at  p'(S  |S')=0,  we  will  have  located  a  local  minimum  which  implies  p(S  |S')  cannot  be  single  peaked. 
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From  (25)  we  have: 

While  no  general  statement  can  be  made  regarding  the  transition  between  states  we  have  the 
following: 

Proposition  3:  The  conditional  density  of  S,  is  non-unimodal  if  g(e)  is  more  concave  than  h(v)  in  the 
neighborhood  of  p'(S  |S')=0. 

Proposition  3  suggests  turning  points  tend  to  be  abrupt  when  g(.)  is  locally  more  concave  than 
h(.).  Though  this  is  not  a  necessary  condition,  naturally  when  g(.)  has  its  mass  concentrated  at  a 
particular  point,  it  will  tend  to  be  more  concave.  This  can  be  seen  in  Fig.l.  If  the  idiosyncratic  shock 
is  uniformly  distributed,  G(.)  is  a  straight  line  and  output  growth,  though  persistent,  is  distributed 
uniformly  along  the  continuum  (a^OofCC,).  However,  if  the  distribution  function  is  concentrated  in 
the  middle  (as  in  Fig.l)  economic  states  become  more  distinct  in  the  sense  that  the  conditional 
distribution  of  {S,}  is  concentrated  in  particular  intervals  of  (0,1).  As  a  consequence,  turning  points 
are  more  likely  to  approximate  the  discreetness  of  regime  shifts. 

To  investigate  further  this  point,  we  utilize  a  simulations  approach.  Recall  that  Fig.3  showed 
the  case  where  8,/80=l/2  and  the  aggregate  and  idiosyncratic  shocks  were  equally  uncertain  with  a 
variance  of  0.25.  Fig.  4  maintains  the  degree  of  increasing  return  at  the  same  level  but  increases  the 
variance  of  the  idiosyncratic  shock  to  2.5.  The  very  different  cyclical  patterns  in  the  two  figures  are 
readily  apparent:  in  Fig.3,  there  are  never  less  than  60%  of  agents  investing  so  that  even  in  recession 
large  numbers  of  agents  are  still  active,  and  turning  points  are  extremely  sharp,  particularly  the 
observations  at  around  37  and  73.  In  contrast,  the  greater  importance  of  idiosyncratic  shocks  adds  a 
considerable  amount  of  noise  to  the  cyclical  indicator  in  Fig.4.  And  while  the  turning  points  at 
observations  37  and  73  can  still  be  detected,  they  represent  only  two  of  several  observations  where 
the  cyclical  component  changes  direction. 

The  above  results  and  discussion  show  again  that  the  form  of  heterogeneity  is  crucial,  this 
time  in  determining  the  sharpness  of  turning  points  and  they  also  suggest  that  the  limiting  case  where 
aggregate  uncertainty  is  much  more  important  than  idiosyncratic  uncertainty  is  useful  to  analyze  as 
a  benchmark. 

(ii)  A  Model  of  Regime  Shifts 

Proposition  3  and  related  simulations  suggest  that  the  more  concentrated  the  idiosyncratic 
distribution  the  more  appropriate  is  a  regime  shift  characterization.  Further,  Corollary  2  implies  that 
increases  in  heterogeneity  may  reduce  the  persistence  of  cyclical  fluctuations  by  reducing  g(o)0).  This 
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suggests  that  the  business  cycle  will  display  extreme  regime  shifts  and  possess  a  highly  persistent 
cyclical  component  in  a  version  of  our  model  with  no  heterogeneity.  In  this  subsection  we  focus  on 
such  a  model  by  setting  ut'=vt.  The  particular  interest  in  this  special  case  is  that  it  offers  a  theoretical 
justification  for  the  widely  used  discrete  Markov  state  space  models. 

Because  the  model  only  contains  an  aggregate  shock,  the  laws  of  motion  for  the  aggregate 
economy  are  the  same  as  those  for  the  individual  firm.  From  our  results  of  Section  3(ii)  and 
introducing  the  notation 

Prob[St=l\St^\]=p 
Prob[St=Q\St_^]=q 

we  have 

Proposition  4:  The  stochastic  process  for  the  change  in  aggregate  output  is 

AY,=-a0+(a1+v,)S,  (28) 

where  St  is  a  Markov  chain  with  transition  matrix 

p   M 


T 


(29) 


and 


p=  f  dH(v)  ,    q=fdH(y)  (30) 


<■>„-&>, 


As  in  the  model  with  heterogeneity,  cyclical  fluctuations  are  the  result  of  random  shocks 
hitting  the  economy.  These  shocks  are  propagated  by  intertemporal  increasing  returns  in  a  manner 
which  requires  a  state  space  formulation  for  output  growth.  The  distinguishing  feature  of  (28)-(30) 
is  that  fluctuations  take  the  form  of  shifts  between  distinct  economics  states.  In  each  of  these  states 
the  economy  behaves  differently,  not  only  does  the  growth  rate  differ  (oc,?K))  but  if  p*l-q  then  so  do 
the  durations  of  booms  and  recessions,  features  common  with  the  model  of  Hamilton  (1989)  which 
closely  corresponds  to  (28)-(30). 

This  regime  shift  model  also  shares  a  number  of  similarities  with  the  model  of  Durlauf  (1993) 
who  also  explicitly  models  the  transition  between  states  of  the  business  cycle  and  focuses  on 
technology  adoption.  In  both  models  this  choice  involves  a  non-convexity  and  intertemporal 
increasing  returns  to  scale  with  the  end  result  being  that  white  noise  productivity  shocks  are  converted 
into  serially  correlated  output  fluctuations.  Both  models  also  suggest  a  strong  role  for  path 
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dependence,  in  (28)-(30)  shocks  which  shift  the  economy  from  one  state  to  another  are  highly 
persistent  as  they  affect  the  way  the  economy  responds  to  future  shocks.  However,  a  key  difference 
between  ours  and  Durlauf  s  work  comes  in  the  form  that  the  intertemporal  non-separability  takes.  In 
Durlauf  s  model  the  intertemporal  linkage  arises  through  localized  technological  spill-overs;  in  other 
words  firms  have  a  higher  propensity  to  invest  if  their  neighbors  invested  in  the  recent  past.  The 
impact  of  this  externality  is  the  focus  of  Durlauf  s  paper.  Due  to  the  externality  Durlauf  s  model 
generates  multiple  long  run  equilibria  in  the  sense  that  the  stochastic  process  for  output  is  non- 
ergodic.  However,  in  our  model,  because  increasing  returns  to  scale  are  internal,  and  this  not  only 
implies  that  the  equilibrium  path  is  uniquely  determined,  i.e.  given  v,  and  {S,.,,St.2,..},  we  know  with 
certainty  in  which  state  the  economy  will  be,  but  also  that  {AY,}  is  always  ergodic. 

6.  Econometric  Evidence 

In  this  section  we  use  the  econometric  results  of  other  researchers  as  well  as  maximum 
likelihood  estimates  of  our  own  general  model  of  Section  4  to  assess  what  scale  of  intertemporal 
increasing  returns  and  what  form  of  heterogeneity  are  necessary  to  account  for  the  business  cycles 
in  the  U.S.  We  also  use  the  analysis  of  Sections  3  and  5  to  infer  from  our  estimates  the  exact  form 
of  U.S.  business  cycle  asymmetries  and  the  relative  importance  of  idiosyncratic  and  aggregate 
sources  of  uncertainty. 

(i)  Regime  Shift  Models 

Hamilton  (1989)  estimates  a  two  state  discrete  Markov  model  for  US  GNP  which  is  nearly 
identical  to  (28)-(30)  and  finds  p=0.9,  q=0.76  and  oc,=1.52.  To  calculate  the  implied  degree  of 
intertemporal  increasing  returns  we  solve  the  equations  in  (A3)  in  the  appendix  using  these  estimated 
parameter  values  and  making  assumptions  regarding  the  distribution  and  variance  of  aggregate  shocks 
plus  the  magnitude  of  the  learning-by-doing  effects  (6,/80)9.  Fig.5  shows  different  combinations  of 
5,/50  and  o2v  which  generate  p=0.9  and  q=0.76  when  aggregate  shocks  are  normally  distributed.  To 
obtain  the  same  persistence  in  St,  a  higher  variance  requires  more  increasing  returns.  The  reason  for 
this  is  that  the  greater  the  variance  of  shocks,  the  more  likely  agents  are  to  receive  a  future  shock  less 
than  (Do-CD,.  This  lessens  the  expected  future  benefits  from  increasing  returns  and  makes  agents  both 
less  likely  to  invest  and  less  likely  to  remain  investing  once  they  have  done  so.  Thus  with  increasing 
returns  of  22%  (8,/80=0.22)  to  produce  the  appropriate  values  of  p  and  q  we  require  a2v=0.05,  but 
when  G^O.Ol  we  need  only  8,/80=ll%.  As  well  as  providing  persistence,  intertemporal  increasing 
returns  generate  considerable  amplification  of  the  productivity  shock.  For  the  case  where  8,/80=0.22 
and  ^=0.05,  even  though  cVai  =  0.03,  the  variance  of  AYt  is  equal  to  0.634.  Relying  only  on  a 


Oq  does  not  influence  the  values  of  cOq  and  go,  and  so  (without  loss  of  generality)  it  is  set  to  ensure  the  model 
matches  mean  US  output  growth. 
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single  productivity  shock  to  drive  output  fluctuations,  we  need  implausibly  large  learning-by-doing 
effects  of  53%  to  explain  Hamilton's  results.  But  with  an  additional  additive  disturbance  in  (1),  as 
assumed  by  Hamilton  and  all  econometric  implementation  of  unobserved  component  models,  his 
results  can  be  explained  with  very  small  amounts  of  intertemporal  increasing  returns  and  aggregate 
uncertainty. 

Results  from  alternative  studies  confirm  this  finding  that  only  small  scale  intertemporal 
increasing  returns  are  necessary  to  generate  empirically  observed  regime  shift  behavior.  Suzanne 
Cooper  (1994)  uses  monthly  industrial  production  from  1931  to  estimate  a  transition  matrix  similar 
to  (28).  To  match  her  estimates  of  the  transition  probabilities  (p=0.55  and  q=0.46)  while  also 
matching  the  variance  of  US  GNP  growth  (again  without  resorting  to  any  additional  productivity 
disturbances  other  than  a  unique  aggregate  shock),  we  need  the  saving  in  fixed  costs  arising  from 
learning-by-doing  to  be  only  around  3%  (i.e.  5[/50=0.03).  Diebold  and  Rudebusch  (1994)  estimate 
equations  analogous  to  (28)  using  US  industrial  production  (allowing  for  a  time  dependent  T). 
Assuming  only  one  disturbance  and  using  their  estimated  standard  error  for  AY,  to  calibrate  ov2  (an 
underestimate  as  this  implicitly  sets  S,=l  for  all  t),  we  find  8,/80=0.8%  is  sufficient  to  explain  their 
results. 

(ii)  The  General  Unobserved  Components  Model 

In  this  subsection  we  use  (15)  and  (16)  to  obtain  estimates  of  our  structural  parameters.  These 
equations  represent  a  general  state  space  model  and  require  assumptions  regarding  the  distribution  of 
idiosyncratic  shocks  before  they  can  be  estimated.  We  examine  these  equations  assuming  first  that 
idiosyncratic  shocks  are  uniformly  distributed  and  then  that  they  are  normally  distributed.  As  well  as 
enabling  us  to  assess  the  robustness  of  our  estimates  of  structural  parameters  these  assumptions  allow 
us  to  examine  the  importance  of  asymmetries  in  U.S.  output  fluctuations.  As  discussed  above,  the 
assumption  of  uniform  idiosyncratic  shocks  removes  any  asymmetry  in  the  state  equation.  Estimating 

(15)  and  (16)  under  different  distributional  assumptions  therefore  enables  a  simple  heuristic  test  of 
the  importance  of  asymmetries  in  U.S.  business  cycles. 

Assuming  that  idiosyncratic  shocks  are  uniformly  distributed  over  the  range  [-a,a],  (15)  and 

(16)  can  be  written  as 

„     a-oin     o>,         v,  (31) 

'  l   2a   1   2a   M  2a 

=c+  rs,_1+  ut 

where  the  coefficients  ty  are  functions  of  the  structural  parameters  u)0,  co„  a,  o^,  and  a,  and  aside  from 
the  squared  disturbance  in  the  measurement  equation  this  is  the  standard  return  to  normality  model 
(e.g.  Harvey  (1989),  ch.3).  Various  versions  of  this  model  have  been  used  to  estimate  the  cyclical 

19 


component  of  U.S.  GNP,  with  Watson  (1986)  and  Clark  (1987)  both  estimating  models  similar  to 
(31)10.  If  instead  we  assume  that  idiosyncratic  shocks  are  distributed  normally,  then"  (taking  second- 
order  Taylor  approximations  for  all  higher  order  terms,  full  details  available  from  the  authors)12; 

Ay,=VV,+V,+V/2+VA/-i 

5r(l-G(»o))+[G(uo)-G!(wo-ui)]5i-i^uo)v/  (32) 

fe(«o-wi)-S(<0o))vA-i 

Equations  (31)  and  (32)  show  that  a  simple  test  for  asymmetry  in  either  the  state  or 
measurement  equation  is  to  test  the  significance  of  the  vtStl  term. 

We  estimate  (31)  and  (32)  using  the  growth  rate  of  quarterly  U.S.  real  GNP  for  the 
period  1954:1  and  1987:4.  Because  the  equations  contain  only  one  shock,  we  augment  the 
measurement  equation  in  each  case  with  a  normally  distributed  additive  measurement  error. 
Estimation  was  performed  using  maximum  likelihood  via  the  Kalman  filter.  We  also  ignored  the 
squared  disturbance  term  which  considerably  simplified  the  estimation.  In  neither  case  did  our 
estimates  reveal  any  indication  of  heteroscedasticity,  suggesting  that  dropping  the  squared  term 
was  not  an  important  omission.  Because  in  (31)  and  (32)  the  measurement  and  state  equation 
have  a  correlated  disturbance  a  simple  alteration  is  required  to  the  standard  recursions  of  the 
Kalman  filter  (see  Harvey  (1989),  p.112).  Table  1  contains  our  estimation  results,  and  Figure 
6  shows  the  (smoothed)  estimates  of  S,  that  emerge  from  the  different  assumptions  on 
idiosyncratic  disturbances.  Both  versions  of  the  model  suggest  that  a  persistent  cyclical 
component  successfully  accounts  for  serial  correlation  in  U.S.  output  growth. 

Examining  the  uniform  distribution  case  (equation  (31)),  we  find  that  the  cyclical 
component  is  persistent,  with  an  autoregressive  coefficient  of  0.52  and  the  model  has  a 
R2=0.595,  thus  explains  just  under  60%  of  the  variation  in  U.S.  GNP  growth.  The  estimates  of 
the  structural  parameters  that  we  uncovered  were  as  follows:  the  variance  of  the  aggregate 
shock,  av2,  was  0.06,  the  variance  of  the  idiosyncratic  shock,  ae  2,  was  1.45.  This  implies  that  the 


10  Watson  and  Clark  estimate  their  model  using  the  level  of  the  U.S.  real  GNP  allowing  for  a  trend,  a  cycle  and  an 
irregular  component.  Thus  our  comparison  is  purely  with  their  specification  of  the  cyclical  component,  our  equation 
(16).  They  both  actually  estimate  the  cyclical  component  as  an  AR(2)  process.  Acemoglu  and  Scott  (1993)  show  this 
can  easily  be  allowed  for  by  letting  intertemporal  increasing  returns  operate  with  longer  lags. 

11  Equation  (32)  is  an  approximation  which  is  valid  for  any  distributional  assumption  regarding  idiosyncratic  shocks. 
Different  assumptions  regarding  G(.)  lead  to  different  estimates  of  the  model's  structural  parameters. 

12  The  exact  expression  for  b4  is  as  follows: 

[/«T°W  "^(M)rf"]x{5(a)o-a)i)-S(wo)} 

btfG(<»($-G(uQ-**j\[l — "°  "'  — MWo-WiteK-^ih^oSK)- 

[G(fc)0)-G(a>0-o1)]2 

This  term  disappears  when  idiosyncratic  shocks  are  uniform. 
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variance  of  idiosyncratic  shocks  is  around  25  times  that  of  aggregate  shocks,  thus  heterogeneity 
being  relatively  much  more  important  than  aggregate  uncertainty.  This  result  is  in  line  with  those 
of  Schankerman  (1991)  and  Davis  and  Haltiwanger  (1992)  from  micro  data  that  idiosyncratic 
variations  are  large  compared  to  comovement  across  all  agents.  Given  our  results  in  the  last 
section,  this  finding  also  explains  why  statistical  tests  of  Hamilton's  (1989)  model  (e.g.  Hansen 
(1992)  and  Garcia  (1992))  reject  the  notion  of  regime  shifts  between  distinct  states  as  an 
appropriate  characterization  of  U.S.  business  cycles.  We  also  recovered  ax  to  be  0.0201,  which 
implies  that  when  the  business  cycle  indicator  S,  is  at  its  peak,  GNP  growth  is  2%  higher 
compared  to  a  trough. 

Finally,  using  our  estimated  structural  parameters  and  assuming  a  real  interest  rate  of 
4%  per  annum  we  can  use  equations  (A3)  to  calculate  estimates  for  50  and  Sx ,  the  learning-by- 
doing  parameters.  We  find  that  S0  =  1.96  and  Sx  =  0.54  implying  intertemporal  increasing  returns 
(SJSq)  of  around  27.6%  (as  a  proportion  of  fixed  costs). 

The  estimates  of  equation  (32)  which  allows  for  asymmetries  reveal  considerably  greater 
persistence  in  the  cyclical  component  —  the  autoregressive  coefficient  in  the  state  equation  is 
now  0.67  as  opposed  to  0.52  in  the  uniform  case.  More  importantly,  R2  in  this  case  goes  up  to 
0.712,  thus  the  model  with  normally  distributed  idiosyncratic  shocks  explains  over  70%  of  the 
variation  in  U.S.  business  cycles.  The  superior  performance  of  this  version  of  the  model  is  also 
significant  as  equation  (31)  is  essentially  what  Clark,  Harvey  and  Watson  have  estimated,  thus 
the  more  general  model  suggested  by  our  theory  that  allows  for  asymmetries  outperforms  the 
conventional  statistical  model. 

Further,  the  interaction  term  between  vt  and  S,.x  is  significant  in  both  the  state  and  the 
measurement  equations,  strongly  so  in  the  former,  confirming  the  asymmetric  nature  of  U.S. 
business  cycles  and  thus  indicates  where  the  improved  performance  of  (32)  comes  from.  Figure 
6  shows  the  estimated  sequence  for  the  cyclical  indicator  arising  from  assuming  normal  and 
uniform  idiosyncratic  distributions.  The  strong  correspondence  between  the  two  sequences  is 
encouraging  since  it  implies  that  we  are  basically  uncovering  the  same  underlying  component 
in  both  exercises.  However,  there  is  a  major  difference  between  the  two  series:  the  asymmetric 
model  enables  much  sharper  downward  swings  into  recession  and  it  is  this  feature  of  the  data 
which  accounts  for  the  success  of  the  more  general  model.  These  sharper  downswings  are 
captured  by  the  positive  coefficient  on  the  v,  Sul  term  and  are  in  line  with  the  findings  of  Neftci 
(1984)  who  used  a  different  statistical  methodology  than  the  one  here. 

Again  uncovering  our  structural  estimates,  we  find  that  ax  =  0.017  which  is  less  than  the 
estimate  of  the  growth  difference  between  boom  and  recession  that  we  obtained  from  (31). 
However,  note  that  this  does  not  imply  growth  to  be  higher  by  only  1.7%  at  the  peak  of  the 
cycle  because  in  contrast  to  (31),  we  also  have  the  added  flexibility  of  having  the  asymmetry 
term  vtS,_x.  The  presence  of  this  term  implies  that  as  long  as  we  remain  in  a  boom  (i.e.  positive 
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values  of  v,)  we  obtain  an  added  growth  effect  and  this  is  of  the  order  of  0.2%,  thus  at  the  peak 
of  the  cycle  growth  is  higher  by  1.9%.  But  also  this  term  implies  that  when  there  is  a  downturn 
(v,<0),  this  can  happen  rather  sharply.  Returning  to  the  rest  of  the  estimates,  av2 was  0.035  and 
cre2  was  0.1207,  so  our  estimate  of  the  ratio  of  the  variance  of  idiosyncratic  shocks  to  aggregate 
shocks  is  around  3.5,  considerably  less  than  the  approximate  estimate  of  10  from  Schankerman 
(1991)  and  Davis  and  Haltiwanger  (1992),  though  still  giving  a  substantial  role  to  idiosyncratic 
disturbances  in  generating  uncertainty.  The  fact  that  our  asymmetric  model,  (32),  leads  to  a 
large  estimate  of  the  relative  variance  of  aggregate  shocks  may  be  due  to  the  absence  in  our 
model  of  any  interactions  across  individual  investment  decisions.  As  a  result,  what  may  be  firm 
specific  effects  translated  into  co-movements  by  economic  interactions  across  agents  will  be 
estimated  as  aggregate  shocks  in  our  specification.  Examining  whether  this  is  actually  the  case 
clearly  requires  a  richer  data  set  than  the  aggregate  time  series  we  use  here.  However,  our 
estimates  show  that  even  without  spillover  effects  we  can  have  a  model  where  idiosyncratic 
shocks  are  the  major  source  of  uncertainty  yet  output  growth  shows  a  clear  cyclical  pattern  with 
sharp  swings  into  recession.  Resorting  again  to  (A3)  we  find  that  our  asymmetric  model  leads 
to  estimates  of  S0  and  Sl  of  1.81  and  0.63  respectively,  so  that  intertemporal  effects  of  around 
1/3  are  required  to  explain  U.S.  fluctuations. 

It  is  also  interesting  to  note  that  the  increased  complexity  of  (32)  implies  that  our 
structural  parameters  are  overidentified  (see  for  instance  the  expression  in  footnote  11)  whereas 
they  were  exactly  identified  in  (31).  This  gives  us  two  overidentifying  restrictions  with  which  to 
test  the  plausibility  of  our  model.  Performing  a  Wald  test  on  these  restrictions  gives  a  test 
statistic  of  4.1  which  is  asymptotically  distributed  chi-squared  with  two  degrees  of  freedom,  thus 
comfortably  accepting  the  overidentifying  restrictions.  This  result  is  promising  as  it  is  another 
piece  of  evidence  that  our  model  provides  a  good  match  to  the  cyclical  dynamics  of  U.S.  GNP. 

7.  Conclusion 

We  have  outlined  a  theory  of  economic  fluctuations  based  on  internal  intertemporal 
increasing  returns  in  a  model  of  discrete  investment  choice.  This  model  is  motivated  by 
microeconometric  findings  of  persistence  in  firm  level  investment  as  well  as  the  emphasis  placed 
in  the  management  science  and  economics  of  technology  literature  on  the  importance  of 
learning-by-doing  in  the  adoption  of  new  technologies.  Incorporating  these  effects  into  a  model 
of  a  firm's  investment  choice  leads  to  a  tractable  model  of  business  cycle  fluctuations. 

This  tractability  enables  us  to  fully  analyze  the  determinants  of  aggregate  economic 
fluctuations.  Our  theoretical  findings  are:  (i)  Intertemporal  increasing  returns  naturally  lead  to 
temporally  agglomerated  and  asymmetric  economic  fluctuations,  in  the  sense  of  persistent 
periods  of  low  and  high  growth  separated  by  business  cycle  turning  points  (ii)  Cross-sectional 
considerations  play  a  key  role  in  determining  the  non-linear  and  asymmetric  nature  of 
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fluctuations  and  distributional  issues  are  found  to  be  crucial  in  determining  the  sharpness  of 
turning  points  and  deciding  whether  aggregate  output  experiences  regime  shifts,  (iii)  Although 
heterogeneity  plays  a  key  role,  all  the  business  cycle  relevant  information  is  captured  in  one 
variable,  the  average  number  of  active  agents,  thus  our  model  enables  a  synthesis  between 
representative  agent  models  and  those  stressing  the  importance  of  heterogeneity. 

A  major  attraction  of  our  model  is  its  ability,  under  certain  simplifying  assumptions,  to 
justify  the  popular  unobserved  component  models  of  Watson  (1986),  Clark  (1987)  and  Hamilton 
(1989)  all  of  which  place  a  special  emphasis  on  an  underlying  cyclical  indicator.  An  additional 
attraction  is  the  model's  ability  to  offer  a  general  formulation  for  cyclical  components,  and 
provide  a  simple  test  for  the  presence  of  business  cycle  asymmetries.  The  data  suggest  that  our 
general  model  that  incorporates  asymmetric  effects  provides  a  good  fit  to  U.S.  business  cycles; 
that  these  are  characterized  by  asymmetries,  with  particularly  fast  downturns  that  cannot  be 
captured  by  a  linear  model;  that  the  cyclical  component  is  highly  persistent;  and,  that  despite 
the  asymmetries,  business  cycles  are  not  characterized  by  sharp  swings  such  as  required  by 
regime  shift  models.  In  our  context,  this  latter  fact  corresponds  to  the  aggregate  sources  of 
uncertainty  being  dominated  by  idiosyncratic  shocks,  which  is  what  investigations  on  micro  data 
also  find.  Estimates  of  the  degree  of  internal  intertemporal  increasing  returns  necessary  to 
account  for  U.S  business  cycles  vary  between  3  and  53%,  suggesting  that  under  some 
circumstances  only  very  modest  amounts  of  intertemporal  increasing  returns  are  needed.  Adding 
other  sources  of  uncertainty  to  the  model  or  allowing  for  spillovers  between  agents  would  serve 
to  reduce  even  further  the  extent  of  internal  increasing  returns  required.  Assessing  the  relative 
importance  of  internal  and  external  intertemporal  increasing  returns  is  an  obvious  topic  of 
further  research.  At  this  stage,  our  results  lead  us  to  conclude  that  intertemporal  increasing 
returns  may  be  an  important  channel  of  persistence,  amplification  and  asymmetries  in  economic 
fluctuations. 
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Figure  2  :  5,/50=0.33  a2v=0.25  a1  =0.25  a2s=0.064  a%=0.841  a\=0.025 
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Figure  3  :  5,/50=l/2  a2v=0.25  a2  =0.25  a2s=0.014  a2  =0.541  a2,=0.032 
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Figure  4  :  6,/50=l/2  ^=0.25  a2  =2.5  a2s=0.012  ^=4.855  a\=0.001 
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Figure  5  :  Learning-by-Doing  and  Volatility  of  Aggregate  Shocks 
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Figure  6  :  The  Cyclical  Component  of  U.S.  GNP  Growth 
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Figure  6  :  The  Cyclical  Component  of  U.S.  GNP  Growth 
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Table  1  :  Estimates  of  General  and  Uniform  Model 


Uniform  Case 

Normal  Case 

Parameter 

Estimate 

Tstatistic 

Estimate 

Tstatistic 

bo 

0.0004 

1.691 

0.0020 

1.957 

bt 

0.0201 

3.107 

0.0173 

3.284 

b4 

0.1027 

1.985 

c 

0.1826 

2.232 

0.1769 

2.173 

T 

0.5213 

5.669 

0.6653 

6.339 

K 

0.3229 

3.496 

°u2 

0.0070 

2.703 

0.0065 

2.842 

R2 

0.595 

0.712 

Sample  period  1957:1,1987:4.  Data  =  Quarterly  growth  in  U.S  GNP.  Estimation  by 
Maximum  likelihood  via  Kalman  filter. 
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Appendix 

Proof  of  Proposition  1 

We  need  to  establish  that  (i)  the  value  function  defined  in  (8)  satisfies  (7)  for  particular 
values  of  a>0  and  (Oj  and  <f>'s  (ii)  the  value  function  satisfies  a  transversality  condition. 
To  establish  (i)  we  substitute  (8)  into  (7)  to  derive: 

I/u^Wq-w^^,  then 

00 

+/5[  /  (4>Q+<l>Jiyl-i**o+"l+uWs+il>uu*i)dflu>i)  (Ai) 

>0-G>l 

/  (0o4^//n+«o+ai+^)d^l'M)] 


+ 


where  a  similar  equation  applies  for  vt<  (Oq-co^!.  Equating  coefficients  gives 

I^-V^.   /  ^("m)+A   /  "m^("m)  (a2) 

/  P  0)0-0);  HU0-"l 

^o-^o= ~ 

1-/8  /  <*F(u,+1) 

OJQ-O)! 

To  find  expressions  for  (o0  and  (Oj  we  use  (8)  to  see  under  what  conditions  the  left  hand  side 
of  (5)  is  greater  evaluated  at  s,=l  than  evaluated  at  s,=0.  This  gives 

0)^(1-0) 

0)0 

G)0=[l-)3     /     dF{ut^Y.\{\-P)h,-*,- 


ao-S^l-P) 


(A3) 


Defining  the  right  hand  side  of  the  definition  of  o>0  as  t(g)0)  we  need  to  prove  t(g)0)  has  a  fixed 
point.  Defining  z(u)  as  t(u)-u  we  have 

limo,-+=  z(«)=-°°    and    lim^.^  z(o>)=+oo  (A4) 

Continuity  of  z(u)  follows  from  continuity  of  t(.)  and  by  the  intermediate  value  theorem  z(u) 
must  have  a  zero.  Thus  a  fixed  point,  a)0,  of  t(.)  exists. 
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(ii)  The  value  function  defined  in  (8)  satisfies  a  transversality  condition  iff 

oo 

Km^jS'/  K«M.'M.«0<ffW=°  (A5) 


since  V(.)  is  linear  condition  is  satisfied  for  B<1. 

Establishing  uniqueness  of  <o0  requires  proving  z(o)0)  has  a  unique  fixed  point.  z(co0)  is 
everywhere  differentiable  and  its  derivative  equals 

00 

zK)=iW(<>-w/K-(l-WMli3     /     dF(ut+l)Yl 

oo  "0 

+[(1-/3)50-^^(1-^      /     ^(uM)-^     /     ul+1dF(u„)]  (A6) 

u0-(l-P)Sj  o>o-(l#i 

<->o 

xj8[/-(Q0)^(o)0-(l-)8)51)][l-i8     /     dF(«M)]-2-l 

"o-(l-^)«i 


Substituting  z((o0)=0,  we  have  z'(a)0)=-l  which  establishes  that  z(o)0)=0  can  only  be 
true  at  a  unique  value  of  <d0. 

We  now  establish  by  contradiction  that  there  can  be  no  non-linear  solutions  to  the 
recursion  (7).  Let  W^^s^Um)  be  a  solution  to  (7)  and  for  given  values  of  the  state 
variables  let  stw  be  the  optimal  choice  of  st.  Thus 

(A7) 

J%l+V(ai+Mi><V.«wWMM) 


+ 
-oo 


Observation  1:  ^"(y^s^.u,)  cannot  depend  on  y,.!  as  returns  from  a  higher  yt.x  accrue 
under  both  s,w=0  and  stw=l. 

We  know  by  assumption  that  both  V(.,.,.)  in  (8)  and  W(.)  satisfy  (7).  Take  a  value  of 
u,  where  V(.)  gives  stv=0.  The  difference  between  (5)  evaluated  at  W(.)  and  V(.),  for  given 
u,,  is 
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^{-B/RO'M-»«0+(a1+i0*r.*,,r,«Vi)  ,AO, 

<A8> 

-/(*0^JC^M^^^M)^O-/(^jC^-l+ab))^l,M)> 

From  Observation  1  this  holds  for  all  y,.!  for  fixed  u„  implying  that 

Thus  W(.,.,.)  is  linear  in  y,.!  with  a  constant  coefficient.  To  see  this  note  that  we  can 
repeat  this  exercise  for  s,.x=0  or  1  and  get  the  same  coefficient  kt.  Similarly  we  can  repeat 
the  exercise  for  values  of  u,  at  which  stv=  1  and  get  the  same  relationship. 
Observation  2:  If 

E,T^  ^{yt.^t._^t.  \srl)>E^  i*r(yt^H_vuN  |*,=0)  (A10) 

is  true  for  u,=u,  it  is  also  true  for  u,=u+A,  such  that  A>0.  The  left-hand  side  of  (A10)  is 
increasing  in  ut  whereas  the  right-hand  side  does  not  depend  on  ut.  This  implies  there  exists 
a  value  of  u„  u^s,^)  (independent  of  a,.!  by  Observation  1)  such  that  ut>(<)uw(st.1)  implies 
that  stw=l  (=0).  Suppose  u^Sn^Oo-^iSt-i  (the  argument  for  the  reverse  inequality  is 
analogous). 

First,  consider  u,  in  the  interval  (-oo^o-cojSi.j).  F°r  aH  ut  m  this  interval,  s,w=stv=0. 
Varying  u,  in  this  interval  gives: 

^(yM^",)-/^(y,-i+aoM+i)=*2  <A11) 

where  Kj  does  not  depend  upon  u,.  As  there  is  no  investment  we  see  from  Observation  2 
that  W(.)  is  independent  of  u,  in  the  interval  (-00,0^0)^). 

Consider  a  value  of  u,  in  the  interval  (u^Sn^+oo).  This  implies  stv=stw=l. 
Considering  variations  of  u,  in  this  interval  gives 

%i^,)-^%i+v«iV.vJ=Wu-^>,  (A12) 

However  W(.,.,.)  is  linear  in  at.v  which  is  linear  in  u,.!  with  coefficient  <f>u,  thus 

Thus  over  this  range  too  W(.,.,.)  is  linear  in  u,,  with  coefficient  <£u=l/(l-j3). 

We  also  need  to  consider  the  interval  ((Oo-OjSn.v^s,.!))  in  which  s,.1w=0  and  W,  does 
not  depend  on  u,  over  (coo-ajjSujU^s,.!)).  Thus  the  coefficient  of  u,  of  the  value  function 
W(.,.,.)  has  the  same  form  as  the  value  function  in  (7)-(8). 
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Finally  we  need  to  check  how  W(.)  is  influenced  by  s,.x.  However  this  variable  only 
takes  the  values  0  and  1.  Thus  any  general  function  h(st.x)  can  be  written  as  k4st.!,  implying 
any  value  function  that  satisfies  (7)  must  have  the  same  form  as  V(.).   QED 
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